Running Head: DIFFICULTIES IN AUTOMATIC SPEECH RECOGNITION DIFFICULTIES IN AUTOMATIC SPEECH RECOGNITION OF DYSARTHRIC SPEAKERS AND THE IMPLICATIONS FOR SPEECH-BASED APPLICATIONS USED BY THE ELDERLY: A LITERATURE REVIEW

نویسندگان

  • Victoria Young
  • Alex Mihailidis
چکیده

Automatic speech recognition is being used in a variety of assistive contexts, including home computer systems, mobile telephones, and various public and private telephony services. Despite their growing presence, commercial speech recognition technologies are still not easily employed by individuals who have speech or communication disorders. While speech disorders in older adults are common, there has been relatively little research on automatic speech recognition performance with older adults. However, research findings suggest that the speech characteristics of the older adult may, in some ways, be similar to dysarthric speech. Dysarthria, a common neuro-motor speech disorder, is particularly useful for exploring automatic speech recognition performance limitations because of its wide range of speech expression. This paper presents a review of the clinical research literature examining the use of commercially available speech-to-text automatic speech recognition technology by individuals with dysarthria. The main factors that limit automatic speech recognition performance with dysarthric speakers are highlighted and then extended to the elderly using a specific example of a novel, automated, speech-based personal emergency response system for older adults.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards a noisy-channel model of dysarthria in speech recognition

Modern automatic speech recognition is ineffective at understanding relatively unintelligible speech caused by neuro-motor disabilities collectively called dysarthria. Since dysarthria is primarily an articulatory phenomenon, we are collecting a database of vocal tract measurements during speech of individuals with cerebral palsy. In this paper, we demonstrate that articulatory knowledge can re...

متن کامل

تخمین سریع ضرایب پیچش در هنجارسازی طول مجرای صوتی با استفاده از امتیاز به دست آمده از مدلسازی تشخیص جنسیت

The performance of automatic speech recognition (ASR) systems is adversely affected by the variations in speakers, audio channels and environmental conditions. Making these systems robust to these variations is still a big challenge. One of the main sources of variations in the speakers is the differences between their Vocal Tract Length (VTL). Vocal Tract Length Normalization (VTLN) is an effe...

متن کامل

Maximum Likelihood Linear Regression (MLLR) for ASR Severity Based Adaptation to Help Dysarthric Speakers

Automatic speech recognition (ASR) for dysarthric speakers is one of the most challenging research areas. The lack of corpus for dysarthric speakers makes it even more difficult. The speaker adaptation (SA) is an alternative solution to overcome the lack of dysarthric speech and enhance the performance of ASR. This paper introduces the Severity-based adaptation, using small amount of speech dat...

متن کامل

Automatic dysfluency detection in dysarthric speech using deep belief networks

Dysarthria is a speech disorder caused by difficulties in controlling muscles, such as the tongue and lips, that are needed to produce speech. These differences in motor skills cause speech to be slurred, mumbled, and spoken relatively slowly, and can also increase the likelihood of dysfluency. This includes nonspeech sounds, and ‘stuttering’, defined here as a disruption in the fluency of spee...

متن کامل

Automatic recognition of dutch dysarthric speech: a pilot study

This paper describes a feasibility study into automatic recognition of Dutch dysarthric speech. Recognition experiments with speaker independent and speaker dependent models are compared, for tasks with different perplexities. The results show that speaker dependent speech recognition for dysarthric speakers is very well possible, even for higher perplexity tasks.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010